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Abstract 

Optimal dense coding using a partially-entangled pure state of Schmidt rank D 
and a noiseless quantum channel of dimension D is studied both in the deterministic 
case where at most messages can be transmitted with perfect fidelity, and in the 
unambiguous case where when the protocol succeeds (probability t x ) Bob knows for 
sure that Alice sent message x, and when it fails (probability 1 — t x ) he knows it has 
failed. Alice is allowed any single-shot (one use) encoding procedure, and Bob any 
single-shot measurement. 

For D < D a bound is obtained for in terms of the largest Schmidt coefficient 
of the entangled state, and is compared with published results by Mozes et al. For 
D > D it is shown that is strictly less than D 2 unless D is an integer múltiple of D, 
in which case uniform (maximal) entanglement is not needed to achieve the optimal 
protocol. 

The unambiguous case is studied for D < D, assuming t x > for a set of D D 
messages, and a bound is obtained for the average {I/t). A bound on the average (r) 
requires an additional assumption of encoding by isometries (unitàries when D = D) 
that are orthogonal for different messages. Both bounds are saturated when t x is a 
constant independent of x, by a protocol based on one-shot entanglement concentration. 
For D > D it is shown that (at least) D 2 messages can be sent unambiguously. 

Whether unitary (isometric) encoding suffices for optimal protocols remains a major 
unanswered question, both for our work and for previous studies of dense coding using 
partially-entangled states, including noisy (mixed) states. 

I Introduction 

Dense coding is an intriguing nonclassical effect made possible by entangled quantum 
states: combining entanglement with a quantum channel allows more information to be 
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transmitted than is possible using these resources separately. The original protocol of Ben- 
nett and Wiesner pQ can be summarized, in slightly altered notation, as follows. Alice and 
Bob share two D-dimensional partides, meaning that each is described in quantum terms 
using a D-dimensional Hilbert space, which are initially in a fully-entangled state. Alice 
carries out one of D 2 mutually-orthogonal unitary encoding operations on her particle and 
sends it to Bob through a perfect D-dimensional quantum channel. Bob measures the quan- 
tum state of the two-particle in a fully-entangled orthonormal basis in order to learn with 
certainty which of the D 2 operations Alice carried out. This protocol can transmit D 2 "clas- 
sical" messages with perfect fidelity, corresponding to a classical channel of capacity 21og£). 
For a careful and mathematically precise discussion of what we shall hereafter refer to as the 
Standard protocol, see [2] . 

If the two partides are in a partially, as opposed to fully, entangled state, can dense 
coding still be carried out, and if so, how many messages can be transmitted? How must 
the Standard protocol be modified in order to accomplish this? Even if D 2 messages cannot 
be sent with certainty, are there probabilistic protocols which allow significantly more infor- 
mation to be transmitted than the log D capacity of the quantum channel by itself ? These 
qüestions have led to a significant body of research. The present paper addresses them in 
two particular cases. 

The first is deterministíc dense coding, where the aim is to send L distinct messages 
with perfect fidelity, and a significant problem is to determine the maximum value Ld of L 
for a given partially-entangled state. The answer is known for a uniformly entangled state, 
our term for a state in which all the nonzero Schmidt coeflicients are identical, when the 
Schmidt rank D is less than the channel dimension D: an appropriate modification of the 
Standard unitary encoding protocol allows the transmission of Ld = DD messages. For 
other situations few exact results are available, though a number of interesting numerical 
and analytical results have recently been published by Mozes et al. j3J- Our contribution 
to this tòpic consists in part in raising the question, which we are unable to answer, as to 
whether unitary (or isometric, see Sec. |H} encoding is sufficient to achieve the maximum 
value Ld- In the case of D less than D we derive a rigorous inequality for Ld which holds for 
a general encoding protocol, unitary or not, and compare it with some of the results in [3]. 
We also explore, in a preliminary way, the situation when D (the Schmidt rank) is larger 
than D (the dimension of the quantum channel), for which unitary encoding is impossible, 
and show that Ld is strictly less than D 2 unless D is an integer múltiple of D. 

The second case we consider is unambiguous dense coding: when Alice encodes message 
x, Bob's measurement will with probability r x teli him precisely which message Alice sent, 
and with probability 1— r x that the protocol has failed. A significant problem is to determine 
the maximum average probability of success P s = (r) for some set of L messages. This will 
depend on the choice of L, with P s decreasing, for a given entangled state, as L increases. 
We consider the case L = DD, the maximum number of messages that can be sent in 
unambiguous fashion for D < D, and derive a bound for the the average inverse probability 
of success {í/t), assuming r x > for every message. We also obtain a bound for (r) when 
encoding is carried out using orthogonal isometries (orthogonal unitàries in the case D = D). 
Both bounds are saturated in the special case in which t x is independent of x by a protocol 
which employs unambiguous entanglement concentration. There are many other cases one 
might wish to consider, and for these our bounds are less useful. In particular, the situation 
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when the number L of messages with t x > is less than DD is hard to analyze, because 
one cannot be sure that unitary (or isometric) encoding is the optimal strategy. Indeed, the 
issue of determining when unitary encoding is optimal remains a major unanswered question 
in studies of dense coding using partially entangled states. In our opinion it deserves a lot 
more attention than it has hitherto received. We can say little about unambiguous protocols 
for D > D aside from showing that it is always possible to send D 2 messages with positive 
probability. We suspect this is the maximum possible number, but we have no proof, nor 
can we identify an optimal protocol. 

The remainder of this paper is organized in the following order. Section |H] introduces 
our notation for entangled states, encoding operations, and measurement POVMs appro- 
priate for deterministic and unambiguous protocols. Next we derive, in Sec. 1111} some very 
general information-theoretic bounds, which are later compared with our inequalities based 
on Schmidt coefficients of the entangled state. Deterministic dense coding is the tòpic of 
Sec. IIVI first a review in IIV Al of the case of uniformly-entangled states for D < D, next 
in llV Bl a rigorous inequality for L d , followed in llV Cl bv a discussion of what happens for 
D > D, and in II V Dl by a remark on protocols that achieve L^. Our discussion of unam- 
biguous dense coding for D < D begins in Sec. |V] with the derivation of the inequality for 
(1/t), and continues in Sec. IVII where additional results are obtained assuming encoding 
using orthogonal isometries. We derive a bound on (r) in IVI Al and show in IVI Bl that 
such encoding is optimal when r x is independent of x for a set of DD messages. However, 
the bound on (r) given in IVI Cl is unlikely to hold if one allows nonorthogonal isometries. 
Unambiguous dense coding in a situation with D > D is the subject of Sec. IV11I 

Section IVI III contains material relating our work to previous research: uniform entan- 
glement for D < D in IVIH Al deterministic dense coding with reference to jïïj in IVIH Bl 
previous work on unambiguous dense coding in IVIII Cl the connection of unambiguous dense 
coding with unambiguous discrimination in lVlll Dl and finally, the issue of unitary encoding 
when carrying out dense coding using noisy (mixed) entangled states, in lVlll El The con- 
cluding Sec. IIXI contains a summary of our results followed by a discussion of open qüestions. 
Two appendices contain technical results. 

II General Framework 

The general setup we shall be studying is shown schematically in Fig. ^ Alice and Bob 
share a normalized entangled state 



of Schmidt rank D on the tensor product 7i a ® T~tb of two Hilbert spaces of dimension d a 
and db, with orthonormal bases and {(fr 7 )}. We assume the Schmidt coefficients are 

ordered from largest to smallest, 



and Xj = for j > D. One can visualize the situation by assuming that 7í a and Tib refer to 
two partides, one in Alice's and one in Bob's possession. Our analysis is simplified through 



D 




(1) 



Ai > A 2 > • • ■ A s > 
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assuming that 

d a = d b = D, (3) 

which is to say |$) is of full Schmidt rank on 7í a ® Tíj,, but nothing essential would change 
if d a or db had vàlues larger than D. In addition, Alice can signal Bob through a perfect 
.D-dimensional quantum channel, where D does not have to be the same as D. 




Figure 1: Encoding and measurement for a dense coding protocol. 

Alice wishes to transmit one of L messages labeled x — 1, 2, ... to Bob, and to do so she 
encodes her message by carrying out a unitary map W x from 7í g ®7í a to 7í/i ® 7í c , where TC g 
refers to an ancillary particle in a pure state \g°), H.h to the final ancillary particle, and 7i c 
is a Hilbert space of dimension d c = D, thought of as a particle which is then sent through 
the noiseless channel to Bob. Since W x is unitary, d g d a = dhd c , but d a = D need not be the 
same as d c = D. By introducing an orthonormal basis {\h )} for Tí^, we can express the 
action of W x in the form 

W x (\g°)®\a)^ =J2\ hl )® ( A M), (4) 
i 

where the A x i, which are known as Kraus operators, map 7i a to 7í c and satisfy the normal- 
ization condition 

E = i~ ( 5 ) 

i 

In the special case in which there is only a single term l = 1 in this sum, we will omit the 
subscript / and refer to the map A x of 7í a to 7í c as an isometry, since A\A X = I a means that 
A x preserves norms. An isometry is only possible when d a = D < D = d c , and if D — D 
the isometry is a unitary operator. In this sense isometric encoding represents a natural 
generalization of unitary encoding in the Standard protocol. To be sure, when D is less than 
D one can always suppose that d a = D and that |<E>) is supported on a subspace of H a , so 
that the particle c sent through the channel is identical with the particle a initially in an 
entangled state. However, we find it more convenient to carry out the analysis assuming 
d a = D. If D is larger than D, one must assume different dimensions for H a and TC C , and 
isometric encoding is not possible. 

Bob's task is to extract information by carrying out a POVM using a collection {B y } of 
positive operators on Hc'&'Hb, as indicated schematically in Fig.[T] For studying unambiguous 
dense coding it is convenient to assume that the label y can take on vàlues 0, 1, 2, ... , with 
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the significance that if the outcome is y = x > 0, Bob knows for sure that Alice sent message 
x, while y = is the "failure" or "garbage" outcome: he does not know which message was 
sent. As is well known, Bob's POVM can always be thought of as a projective measurement 
on 7i c Cg> 'Hb along with an ancillary system prepared in a pure state, and the reader may 
wonder why we have not included this ancillary system as part of Fig. ^ The answer is that 
it is not needed for our analysis, whereas the details of Alice's encoding procedure play a 
more significant role in our discussion. 

The framework outlined above for encoding and decoding is also appropriate, given some 
obvious modifications, for the case in which Alice and Bob share a noisy entangled state, 
represented by a density operator or ensemble, or use a noisy channel. But our entire 
discussion is limited to "one shot" dense coding: Alice does not entangle her input over 
many uses of the apparatus, nor does Bob save the outcomes of múltiple transmissions in 
order to perform a coherent measurement. 

An unambiguous dense coding protocol is thus one in which the {W x } — or equivalently 
the {A^} — and the {B y } have been chosen so that 

Pr(y | x) = T x 6 yx + (1 - T x )óyo, (6) 

where t x is the probability that if Alice chooses to send message x it will be correctly 
transmitted: Bob's apparatus will show y = x rather than y = 0, the latter being an 
indication that the protocol has failed. If r] x is the a priori probability for Alice choosing 
message x, the joint probability distribution will be 

Pr(x, y) = T) x [r x 5 yx + (1 - r x )5 y0 ] . (7) 

This means the average probabilities P s of success and Pf of failure in sending a message 
are 

Ps = ^2vxT Xi Pf = l-P s - (8) 

x 

The deterministic case is one in which r x — 1 for 1 < x < L, where L is the number 
of messages under consideration, and an optimal deterministic protocol is one giving rise to 
the maximum number of messages having r x = 1, assuming |$) and D are held fixed. 
An optimal unambiguous protocol is, roughly speaking, one that yields the maximum value 
of P s but this will depend on the a priori probabilities {i] x }. We shall only consider the case 
T] x = l/L. 

III Information Theory Bound 

Before discussing specific protocols in the following sections, it is convenient to derive 
some simple but quite general information-theoretic bounds on the probability of successfully 
transmitting a message from Alice to Bob. The first is based on the result in Sec. VII of 
jlj, that the classical capacity C of a dense coding "channel" (entangled state plus quantum 
channel) of the sort we are considering is given by 

D < D : C = log D + He, H e = - \) log Xf, (9) 

j 
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He is the entanglement of 1$). The condition D < D is implicit in the derivation in JÏJ. For 
D > D we do not know of a comparable expression, but studies of entanglement assisted 
capacity jS] yield an upper bound 

D> D: C < 21ogL>. (10) 

(This also holds for D < D, but then it is obvious from (jüj), since He cannot exceed logD.) 
A consequence of (J7J) is the conditional probability 

ÍS yx for y > 0, 

Tk0--T x ) . n (11) 

"T^T for y = 0, 

from which the Shannon mutual information 

I(X:Y) = H(X) - H(X\Y), (12) 

can be calculated using 

H{X) = -VVlog^, 

V (13) 
H{X\Y) = (l-P a )H(X\y = 0). 

Because of QJ, F(X | j/) = for all y > 0. 

If we restrict ourselves to the situation in which all L messages have the same a priori 
probability, r\ x = l/L, and use the upper bound H(X \ y — 0) < logL in (fTÏÏj) . the fact that 
I(X : Y) cannot exceed C in © leads to the inequality 

P s logL<logD + H E (14) 

with 

P s = (l/L)J2r x (15) 

the average unweighted probability of successfully transmitting a message. 

In certain cases this bound might be improved by choosing a nonuniform set of a priori 
probabilities {r] x }, or using a better upper bound than logL for H(X \ y — 0). However, 
because of their generality, one cannot expect bounds of this sort to be very tight, and in 
the following sections we obtain for restricted types of protocols improved bounds which are 
not based on Shannon mutual information. 



IV Deterministic Dense Coding 

IV A Uniformly entangled state with D < D 

We use the term uniformly entangled for the state |$) in ((T} when all the (nonzero) Xj are 
equal to each other. When D = d a = db this coincides with the terms "fully" or "maximally 
entangled," but neither term seems appropriate when D is smaller. One of the simplest 
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and most straightforward extensions of the Standard dense coding scheme is to a uniformly 
entangled state D < D, which can be used to send exactly DD messages deterministically 
when encoded using orthogonal isometries. 

Let {A x } be a collection of isometries from TL a to 7í c , see the discussion following (jïïj), 
which are orthogonal in the sense that 

Tr a (AÍA y ) = D5 xy = Tr c (^4), (16) 

where one can use either equation as a definition. If we use the orthonormal basis {|a J )} in 
to write each A x in the form 

D 

where the expansion coefficients are elements of H c , the orthogonality condition (fTüj) 
becomes 

D 

J>^> = D5 xy . (18) 
It follows from this that if |$) is uniformly entangled, the kets 

\* x ) = A x \<ï>) = Ç(V>/S) |7Í> ® W) (19) 
3 

on 7í c ® Tíb form an orthonormal collection. Given a uniformly entangled state and a 
collection of DD orthogonal isometries, there is a straightforward deterministic dense coding 
protocol for L = D D messages: Alice uses A x to encode message x, and Bob measures using 
the orthonormal basis {| & x )}- A proof of the intuitively obvious result that such a protocol 
is optimal can be based on (|14j) with He = log-D, or on (J2ò|) below. Of course, when D = D 
the isometries are unitàries, and we are back to the Standard protocol. 

Here is one way to construct DD orthogonal isometries. Let operators Q : TC a — > 7~£ c , 
R : H a — > T~L a , and S : Ti c — > 7í c be defined by 

D 

R\a?) = e 27ri ü- 1)/ü \a j ), S\c k ) = \c k(B1 ), Q = Y, \<?)(°> S \ ( 20 ) 

j=i 

in terms of orthonormal bases {|a· J )}, 1 < j < D, of Tí a and {|c fc )}, 1 < k < D, of 7Í C ; © 
means addition modulo D. Since R and S are unitary, each of the D D operators 

A afl = S f) QR? t , 0<a<D, 0<(3<D, (21) 

is an isometry from 7i a to TC C . Here each distinct double subscript aj3 corresponds to a 
different value of x, and the counterpart of (fTü|) is 

r L· a (A i a gA a ,g,) = DS aaf S P0f . (22) 
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IV B General bound L d < D/\\ 

Whatever operation Alice carries out to encode message x will result in a density operator 
p x describing the combined 7í c ® 7í& system which Bob will measure, see Fig. [TJ and since 
whatever Alice does has no effect on Bob's particle, its reduced density operator is 

Tr c ( P:c )=Tr a (|$)($|)=^A J 2 |6^)^|, (23) 

i 

independent of x. Two density operators p x and p y corresponding to distinct messages x 
and y can only be distinguished with certainty [ü] if p x p y = 0, which is to say their supports 
are orthogonal: P x P y = 0, where P x is the projector onto the support of p x . Consequently, 
since p x < P x in the sense that P x — p x is a positive operator, for a deterministic protocol it 
must be the case that 

L 

^p x <^P x <I c ®h- (24) 

X = l X 

Upon tracing this inequality over Ti c and using (jlHj), one obtains 

LJ2^\V)(V\ < DI h = DJ2\V)(V\, (25) 

3 3 

so that LX 2 < D for every j. Since Ai is the largest Schmidt coemcient of |$), this implies 
that the maximum value of L satisfies 

L d < D/\\. (26) 

As X\ cannot be smaller than 1/D, this inequality implies that cannot exceed DD, a 
bound which is achievable for D < D using a uniformly entangled state, as shown in part A, 
but not for D > D, see part C below. If |$) is a product state, Ai = 1 and the rather trivial 
bound L d < D is achieved by sending one of D orthogonal states through the quantum 
channel. In other situations the bound (J2bj) is less trivial; see, in particular, the discussion 

in sec. ivnrm 

Taking the logarithm of (}2*üj) . one has 

log L d <\ogD + log(l/A?) < log D + H E , (27) 

where the second inequality follows from the defmition in (jÜj), given that Xf > for all j 
and ^2^ = 1. This shows that (}2*üj) is a tighter bound than the information-theoretic (fT4*j) 
with L = L d and P s — 1. 

IV C Protocols for D > D 

The case D > D stands in marked contrast with that for D < D discussed in part A 
above. To begin with, it is impossible to send DD messages in a deterministic fashion, 
because that exceeds the bound D 2 implied by (fTÜj). But even sending D 2 messages is not 
possible unless D is an integer múltiple of D; otherwise, as we shall show, L d is strictly less 
that D 2 . Furthermore, when D is an integer múltiple of D, a uniformly entangled state is 
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not needed to achieve the optimal protocol, though there is still a nontrivial constraint on 
the Schmidt coeficients. 

Let us first discuss the case D = 2, D = 4, assuming |$) is a uniformly entangled state. 
Without loss of generality one can think of this (up to some local unitàries) as Alice and 
Bob sharing two fully-entangled qubit pairs. An optimal dense coding protocol consists in 
throwing away one pair, and carrying out Standard dense coding with the other, in order to 
send one of D 2 = 4 messages. But of course the pair that was thrown away need not have 
been fully entangled, so it is at least sufficient that the Schmidt coefficients be identical in 
pairs: Ai = À2, and À 3 = À4. Indeed, the pair that was thrown away could have been in a 
mixed state. The general case in which D is an integer múltiple of D can be discussed in 
exactly the same way whenever |<E>) can be thought of as the tensor product of one fully- 
entangled D x D pair with something else: by discarding the latter, which could have been 
in any state whatsoever, and using the former for Standard dense coding one achieves an 
optimal protocol. Describing all of this in terms of the Kraus operators and the POVM of 
Sec. Ois an exercise we leave to the reader. 

Next assume that D is not a múltiple of D. Alice must encode a particular message x 
using a collection of Kraus operators {A x[ } satisfying (jSJ). Since A x[ maps a D-dimensional 
space to one of dimension D < D, its rank, which is the same as the rank of A\ { A xl (p. 13 of 
[7j), is at most D. But the identity operator I a in (JSJ is of rank D > D. Therefore encoding 
cannot be achieved using a single Kraus operator, but requires a Kraus rank n (number of 
independent Kraus operators) bounded below by 

k>Ç:=\D/DI (28) 

where \a] is the smallest integer not less than a. This in turn has the consequence, as 
shown in App. [XJ that Alice's encoding results in a state which when traced down to the 
7í c <8> 7~Lb space available to Bob corresponds to a density operator p x of rank greater than or 
equal to k, whose support is therefore a subspace of dimension at least k. As noted above 
in B, two density operators p x and p y can be distinguished with certainty if and only if 
their supports are orthogonal, and this means that the number of messages that can be sent 
deterministically is bounded above by 

L d <p: = [DD/Ç\, (29) 

where [a\ denotes the largest integer not greater than a. 

When D is an integer múltiple of D, p = D 2 , and, as shown earlier, one can achieve 
this value for Ld by using an appropriate |<E>). However, if D does not divide D, p will lie 
somewhere in the range 

D(D + l)/2 < /i < D 2 , (30) 

so Ld is less than D 2 , a result which is tighter than the information-theoretic bound (|1(J|). 
But we do not know whether = p can actually be achieved, even in the simplest case in 
which D — 2, D — 3, for which £ = 2 and fi = 3. That is, we have been unable to design 
a deterministic protocol for transmitting 3 messages, or to show that it is impossible. If for 
this case only 2 messages can be sent deterministically, the entangled state is of no use and 
might as well be thrown away. 
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The argument that produces the bound in (J29|) does not require that |<&) be uniformly 
entangled, but only that it have Schmidt rank D. There are, of course, cases in which 
Nielsen's majorization condition jHj will permit such a state to be replaced with probability 
1 by a uniformly entangled state of rank D, which would allow Ld = D 2 messages to be sent 
deterministically using the Standard protocol. However, the replacement requires both local 
operations and classical communication, which in the dense coding context means a classical 
side channel. That lies outside the scope of the present paper, though it belongs to a class 
of problems worthy of further exploration. 

IV D No extension of optimal deterministic protocol 

Suppose a deterministic protocol is optimal in the sense that 

t x = 1 for 1 < x < Ld, (31) 

with Ld the maximum possible number of deterministic messages for a given |$), D and 
D. It is then not possible to find an unambiguous protocol in which the same number of 
messages can be sent deterministically, and in addition one or more messages can be sent 
unambiguously with probabilities of success less than 1. In other words, (fïïTj) implies that 
t x = for x > L d . 

The argument is straightforward. The density operators (possibly pure states) p x created 
by Alice on H c (g> Hb must be mutually orthogonal, see the arguments in part C above, for 
1 < x < Ld- Were it possible for her to create yet another p' for an additional message 
x = L d + 1, it would have to be orthogonal to all the p x just mentioned, as otherwise 
there would be at least one message in the set 1 < x < Ld which Bob could not defmitely 
distinguish from message L& + 1. But if p' were orthogonal in this way, the additional 
message could also be sent deterministically, contrary to the assumption that Ld is the 
maximum number possible. 

V Saturated Unambiguous Dense Coding 

Whereas unambiguous dense coding is a complicated problem if one allows the most 
general encoding and decoding protocols, the situation is considerably simpler for D < D if 
one supposes that precisely DD messages can be unambiguously transmitted, each with a 
positive (conditional) probability t x . 

To begin with, it is impossible to send more than DD messages, because Bob is carrying 
out measurements on a .DD-dimensional Hilbert space, and very general arguments jïïj pre- 
clude his unambiguously distinguishing more states than the dimension of this space. Even 
to distinguish DD cases unambiguously, each with some positive probability of success, he 
is forced to use a POVM of the form 

B y = \B y ) (B y \ for 1 < y < DD, B = I c ®I b -J2 B y^ (32) 
where each \B y ) is an element of Ti c ®'Hb- 
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For her part, Alice must be able to prepare for each x a pure state \C X ) G 7í c ® Hb, and 
these states, which in general will not be orthogonal for different x, must form a basis of the 
space. In the language of the general encoding protocol, Sec. HH production of pure states 
means that for each x the Kraus rank of the corresponding operation is 1, so one only needs 
a single term in the sum in (J3J). See App. El for the proof of this intuitively obvious result. 
As a consequence, the normalization condition (JüJ becomes 

AxA x = J a , (33) 

where, as in Sec. |Hj we omit the redundant l in the subscript. This means that A x is unitary 
for D = D and an isometry for D < D. In short, unambiguous dense coding of D D messages 
(with, of course, D < D) means the encoding must be isometric (unitary for D = D)\ other 
possibilities are excluded. 

Finally, (jüj) translates into the condition 

(B y \C x ) = (B y \ (A x ® I b ) |$) = ^F x 5 xy . (34) 

Here \C X ) is normalized, since |<£>) is normalized and A x <S>Ib is an isometry, whereas the \B y ) 
states are not normalized, but satisfy the inequality 

J2\B y ){B y \ < I c ® I b , (35) 

which is necessary so that Bq > in (JÏÏ2j) . (One can always choose the phases so that the 
inner products in (}34|) are positive.) 

At this point we find it convenient to reformulate the problem slightly using map-state 
duality (see [101 HU El)- Let us "transpose" (as that term is used in [T2j) A x in the form 
ÍÍT7I) into the ket 

\A X ) = ^2\ií)®\a 3 ) en c ®n a , (36) 

ó 

and | $) in (0) into the nonsingular map Ò : 7i a — * 7~£b defined by 

These transpositions allow one to rewrite (jïï4*j) in the equivalent form 

(B y \(l c ®Ò)\A x ) = ^6 yx . (38) 

A little thought will show that (|3*K|) can only be satisfied for all x and y between 1 and 
DD, with t x > for every x, if the {|v4 x )} are linearly independent and hence form a basis 
of 7í c <8> 'Ha, and likewise the {\B y )} form a basis of 7í c ® H-b- Because {\B y )} is a basis, it 
has a unique dual or reciprocal basis (e.g., Sec. 15 of 13 j) {\B y )} satisfying 

(B y \B x ) = (B x \B y ) = 5 yx . (39) 

Multiplying (jïïHjl on both sides by \B y ), summing over y, and using the fact that 

J2\B y )(B y \ = I b ®I c , (40) 
y 
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yields the expression 

(l c ®è)\A x ) = ^\B x ). (41) 

connecting Alice's operations to Bob's measurements. Similarly, with the dual basis 

to {\A X )}, 

{L®&)\B X ) = ^- X \À X ). (42) 
Since $ has an inverse, one can rewrite (}4~Tj) in the form 

t- 1/2 \A x ) = (icSè" 1 )!^), (43) 

and from this and from 

J2\By){By\ > h® h, (44) 

y>ï 

which is equivalent to (jïï5J), obtain the inequality 

£(1/t x ) • \A X )(A X \ > I c ® ($tè) _1 . (45) 

Tracing both sides over 7í c and using the fact that 

Tr c (\A x )(A x \)=AlA x = I a , (46) 

one arrives at 

(^l/r :E )/ a > J D(l>tè)- 1 , (47) 

X 

which is equivalent to 

{l/r) := (1/DD) ^(l/r.) > (\ 2 D D)~ l , (48) 

since Xfj is the smallest Schmidt coefficient of |$). In particular, if — Pq IS cl constant 
independent of x, (PS|) telis us that the success probability P s = P c is bounded by 

P c < DX 2 D . (49) 



VI Orthogonal Isometries 

VI A Deriving a bound for P s 

As shown in Sec. |VJ if for D < D a full set of D D messages are to be sent unambiguously 
with positive probabilities, Alice's encoding operation must be an isometry. We now make 
the much stronger assumption that the collection {A x } of isometries used for encoding is 
orthogonal in the sense of (|16|). which is equivalent to 

(A x \A y ) = D5 xy (50) 



12 



for the corresponding kets defined in (J3bj) . This means that the elements of the dual basis 
are given by 

\À x ) = {l/D)\A x ), (51) 
and combining this with (|42j) yields the expression 

(l c ®&)(\B x )(B x \)(l c ®è) = (r x /D 2 )\A x )(A x \. (52) 

Sum both sides over x and use the inequality (jïïïïj) to obtain 

J2(rjD 2 )\A x )(A x \ <I C ®&Ò. (53) 

Tracing both sides over 7í c and using (|4T)j) yields the inequality 

'j^T^I a <D 2 D&®. (54) 

X 

Because the smallest eigenvalue of Ò^Ò is A|,, this means that 

P s = (r) = (1/DD) < A sA (55) 



which can be compared with (j48j) . Note that while (j4aj) holds quite generally for saturated 
unambiguous dense coding, the derivation of (fo7)j) requires the additional orthogonality as- 
sumption ()16|) or (|5Ü|) . In the particular case in which t x = P c is a constant independent of 
x, both (f4*%j) and (fo7)j) lead to the same bound (f4*üj). 

The bound (|55|) is tighter for the case L = DD than the information-theoretic bound 
(I14|) . as can be seen by writing the latter in the form 

logD + logD V ; 

As noted in part B below, the entangled state |<£>) can with a probability \\D be transformed 
into a uniformly entangled state with entanglement \ogD by a local operation, and since 
such an operation cannot increase the average entanglement it follows that 



X 2 5 D\ogD<H E . (57) 

Using this and the fact that He cannot exceed logD it is straightforward to show that the 
right side of (j53|) is bounded above by the right side of (jïïïïj). 



VI B Saturating the bound 

In fact, given any collection of DD orthogonal isometries, there is a POVM of the form 
(132}) which results in each message being transmitted with the same probability of success 
t x = X^D, saturating the bound íjïïB]) . The easy way to see this is to imagine Bob carrying out 
his part of the protocol in two steps: an unambiguous entanglement concentration operation 
(the term used in ^2] for what its originators jTü] called the "Procrustean method" ) on his 
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particle, which if it succeeds transforms |$) into a uniformly entangled state with the same 
Schmidt rank, followed by a measurement, in an orthonormal basis of the type described in 
Sec. IIV Al on the combined system of his particle and the one received from Alice. 

Unambiguous entanglement concentration results from Bob carrying out an operation 
[Tol HTj described by Kraus operators 

#i = E(WA;)|&WÏ, K 2 = yJl b -K\K x . (58) 
j 

It is successful if K\ occurs, for which the probability is 

^\K\K X \§) = X 2 6 D, (59) 

and then Alice and Bob share the uniformly entangled state (1/v^D) J^. ja- 7 ) ® \bP). 

The dense coding protocol consists of Alice encoding in the manner indicated in Sec. IIV Al 
and Bob attempting unambiguous entanglement concentration in the manner just described. 
If the latter is successful, Bob carries out projective measurements on the two partides 
as described in Sec. IIV A| certain that the outcome accurately reflects Alice's encoding. 
Obviously, the probability of success (fHÏÏJ) is independent of x. The two steps of entanglement 
concentration followed by projective measurement can be combined into a single POVM of 
the form ()32|) by setting \B X ) = Ki\§ x ), with |$ x ) defined in (|19|). It is then a simple exercise 
to show that the inequality (j35|) is satisfied. 

VI C Exceeding the bound 

The inequality ()55|) was derived by requiring that all DD messages be transmitted with 
positive probability and that the isometries used for encoding be orthogonal, so it is interest- 
ing to ask whether it holds if either condition is relaxed. We believe that both are necessary, 
but have not been able to prové this. Some insight is, however, provided by the following 
considerations. Suppose that D is 3 or more, and the Xj in (JTJ) are all equal, except for 
Xf) = e > 0, which is much smaller than the others. Then Bob can carry out an operation 
analogous to (j58|l . but with 

D-l 

ff 1 = X>)< 6Í l. K 2 = \b D )(b% (60) 
j=i 

which one can think of as a projective measurement to determine whether or not his particle 
is in \b D ). If, with probability 1 — e 2 , K\ occurs, the resulting uniformly-entangled state 
can be used to transmit D(D — 1) messages in a deterministic manner, using the protocol in 
Sec. IIV Al Then (t) as defined in (jB3j) . assuming r x = for x > D(D — 1), is (1— e 2 )(l — 1/D), 
which can obviously be made larger than e 2 D, which is the right side of (J55|) . 

This example violates both of the conditions used to derive (J55|) . since we no longer have 
saturation — r x = for some of the messages — and the isometries used for encoding now map 
a (D — l)-dimensional Hilbert space onto one of D dimensions. While they can be extended 
to isometries acting on the original D-dimensional space 7í a , these isometries will not be 
orthogonal, see App. [BJ Thus the possibility remains open that (J55|) might be vàlid given 
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the assumption of one or the other but not both of the conditions used to derive it, though 
we ourselves doubt that this is the case. By contrast, the inequality (jlHj) is known to hold 
for a saturated protocol, but is obviously useless if some of the t x are zero. 

VII Unambiguous Dense Coding for D > D 

In contrast to the situation discussed in Secs. |S and IVIl in which the Schmidt rank of 
the entangled state is less than or equal to the dimension of the quantum channel, D < D, 
we have very few results for unambiguous dense coding if D is greater than D; in particular, 
we have no upper bounds on success probabilities analogous to those in (jj^j) and (fïïïïj) . 
The situation is not unlike that for deterministic dense coding with D > D as discussed in 
Sec. IIV Cl we have more qüestions than answers. 

There is a simple argument that shows that with D > D it is always possible to send 
D 2 messages in an unambiguous fashion, and one can place lower limits on the probability 
of success. Assume, as previously, that the Schmidt coefficients are arranged in decreasing 
order, (J2J), and define the two projectors 

D D 

Q a = Y,W)^l Q b = J2\v)(v\ (61) 

3=1 J'=l 

on 7í a and Tib- Then {Q a , Q a = I a ~ Qa} an d {Qb, Qb = h~ Qb} form projective decompo- 
sitions of the identities I a and h- If Alice and Bob carry out projective measurements using 
these decompositions, it is evident from that their results will be perfectly correlated, 
and outcome Q a will be accompanied by Qb with probability 

D 

i=i 

When this occurs, Alice and Bob share a partially entangled state of Schmidt rank D, of the 
form (JTJ), but with the summation limit replaced by D, and Xj replaced by Xj = Xjj\/P m . 
This entangled state can be used for unambiguous transmission of D 2 messages and our 
discussion in Secs. El and EU applies, provided the success probabilities calculated using 
the {Xj} are at the end multiplied by P m . In particular we have an overall protocol, by 
combining the projective measurement just discussed with the entanglement concentration 
of Sec. IVI B| in which allows each of D 2 messages to be sent with a probability of success 
equal to DX 2 D P m . This might be optimal for the equal-probability case, but we have no proof 
that it is. 

Alice's projective measurement is actually not necessary in this scheme, since she can 
always proceed as if the measurement would have been successful, and leave it to Bob to 
declare the transmission a failure if his outcome is Qb- On the other hand it will not do 
for Alice alone to carry out the measurement and communicate the result to Bob, since 
this requires a classical side channel (or something similar), and lies outside the scope of 
protocols we are considering. 

Having both parties carry out the projective measurement has the additional advantage 
that if the common outcome corresponds to Q a and Qb, there will still be some entanglement 
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left, if D — D > 2, or at the very least the bare quantum channel when D — D + 1, which 
can be used to communicate some messages. These considerations, while of some interest, 
teli us very little about possible optimal protocols. We do not even know if D 2 is an upper 
bound on the number of messages that can be sent unambiguously when D is greater than 
D, though we suspect this is the case. 

VIII Comments on Previous Work 

VIII A Uniformly entangled states for D < D 

The system of orthogonal isometries for D < D in Sec. II V Al was first proposed, so far as 
we are aware, in Sec. 3 of ^B]; and later worked out independently in (T^j. In both papers 
it is assumed that Alice's and Bob's partides have Hilbert spaces of different dimensions, 
D and D in our notation, and are in a uniformly entangled state of Schmidt rank equal to 
the smaller dimension. The encoding operation is thought of as a unitary carried out on 
the space of higher dimension. While there is nothing wrong with discussing encoding using 
unitàries rather than isometries — see the comments following (jSJ in Sec. [iï] — it can give rise 
to confusion, because while the encoding task dictates the nature of the isometries, their 
extension to unitàries acting on a range space of higher dimension is to a large measure arbi- 
trary. Such confusion may lie behind the incorrect definition of the unitary operator L t and 
the incorrect trace formula (12) in both of which are vàlid (aside from a typographical 
error) for D = D, but not for D < D, and an incorrect expression (11) for the unitary 
operator U mn in jTÏÏ) . 

VIII B Deterministic dense coding 

A very interesting exploration of deterministic dense coding for small systems of partially- 
entangled pure states has been carried out using a combination of numerical and analytic 
techniques by Mozes et al. some of their results extend to general D (in our notation), 
and they make interesting conjectures about the D-dependence of others. They assume 
without discussing the matter that unitary encoding is optimal, but do not assume that 
the unitàries are orthogonal in the sense in which we use that term, see (fTüj) . (The term 
"orthogonal" in their paper has a different meaning.) 

It is of particular interest that our inequality (|26|) . which transcribed to their notation 
reads XqN^^ < <i, is saturated in a number of cases by their numerical or analytical results 
or conjectures, in the sense that there are nontrivial choices of entangled states for which 
this inequality is an equality. These include the right-most limits of the regions 5, 6, and 7 
in their Fig. 1 — the numbers refer to (N m!ÍX ) — for D = 3, and their conjectured minimal- 
entanglement states for = D + n for n = 2, 3, ... D. It is interesting that the = 7 case 
for D = 3 is not included in their general conjectures. 

Another point of agreement between their work and ours is their observation that 
is not in general a monotone increasing function of the entanglement. Our bound on 
depends on À 1; not on the entanglement. 
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VIII C Unambiguous dense coding 

The earliest work on unambiguous dense coding known to us is the study of Hao et al. 
|2Ü] of a partially entangled state of two qubits. (This and and the later [21] use the term 
"probabilistic dense coding.") The paper actually includes two schemes. The first requires 
a classical side channel, but the second does not, and thus fits within the framework of 
our discussion. The encoding scheme employs orthogonal unitàries, and the probability of 
success saturates the bound (jSSjl . in agreement with our Sec. IVI Bl 

More recently Pati et al. [21] have independently worked out the qubit case, with results 
in agreement with (201- They also considered its extension to general D, using a particular 
collection of D 2 orthogonal unitàries, assuming Alice and Bob share a partially entangled 
pure state. Their upper bound for the success probability has now been superseded by our 
()55|) : the latter is both a tighter bound (in some sense the best possible — see Sec. IVI B|) . and 
was derived under weaker assumptions. 

In addition, these authors construct an example in which an increase in entanglement 
brought about by increasing the Hilbert space dimensions of both Alice's and Bob's partides 
can give rise to a lower average probability for transmitting a given collection of messages, 
even when a uniformly entangled state is employed. We believe it is best to think of this 
somewhat counterintuitive result as arising from the encoding scheme they propose for the 
larger system; in particular, its unitàries are no longer orthogonal. While there is no reason 
to suppose that greater entanglement will always improve a dense coding scheme — see the 
comments in j3] regarding the deterministic case — we think the main lesson to be drawn 
from the example considered in [21] is the importance of paying attention to the encoding 
process, not just optimizing Bob's measurements. 

VIII D Unambiguous state discrimination 

Unambiguous dense coding is related to unambiguous state discrimination, see [ü] 1221 EU] , 
in the sense that Bob's measurement task is to distinguish the two-particle states {|C X )}, 
see ([34JI . in an optimal fashion. In the case of dense coding these states are somewhat 
special in that they all correspond to the same reduced density operator on the Hilbert 
space Tíb of Bob's particle. In addition, whereas in unambiguous state discrimination one is 
generally concerned with optimal discrimination of a set of states thought of as simply given 
in advance, in the dense coding case optimization involves Alice's choice of operations for 
producing the {|C X )} as well as Bob's choice of a POVM to distinguish them. 

Some of our results are related to previous work on unambiguous state discrimination in 
the following way When the success probabilities of state discrimination are required to be 
equal, Chefles in [Ü] derived an optimal average success probability consistent with our ([49)1 . 
Also if in Sec. IVII when D = D the orthogonal unitary operators are of the special form 
used in [21], then the states to be distinguished are divided into D mutually orthogonal sets, 
and the states in each set are linearly independent and symmetric. Therefore, the solution 
given in [22] for the optimal discrimination of symmetric states can be used to obtain the 
maximum average success probability in (|55j). However, our result is more general in the 
sense that it does not depend on any specific form of orthogonal unitàries. 
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VIII E Noisy entangled states 



While dense coding using noisy (mixed) entangled states lies outside the scope of the 
research reported in this paper, there is one feature of the studies of this problem in |24| I25| 
I2ül l2~7| |2Hl 1213 EO! EH] to which we wish to draw attention. These papers arrive at a rather 
simple formula 



for the optimal asymptotic classical capacity, with D the dimension of the noiseless quantum 
channel, p the density operator of the initial entangled state, p& its partial trace down to 
Bob's particle, and S the von Neumann entropy. This result is derived assuming that Alice 
is restricted to unitary encoding of messages, whereas Bob is allowed, and in general must 
employ, the most general decoding operation, including coherent measurements on states 
resulting from múltiple transmissions. 

There is no reason why S(p) cannot be larger than S(pb) — an extreme example is a 
maximally-mixed state — and if that is the case, (J63|) cannot be the optimal capacity, since 
C = log-D is always possible by throwing away the entangled state and using the quantum 
channel in a straightforward way to transmit D messages. To be sure, it is conceivable that 
(JïïÏÏJ) might hold whenever S(pb) exceeds S(p), i.e., when the right side is greater than logD, 
but we know of no compelling or even plausible argument to this effect. What is clearly 
needed is a study of what can be achieved using alternative methods of encoding, and until 
that has been carried out it seems best to regard (|63|) as a lower bound for, rather than 
the actual value of, the optimal capacity for a mixed entangled state. See the additional 
comments in Sec. IIX Bl 

IX Conclusion 

IX A Summary 

We studied the problem of dense coding using a partially-entangled pure state whose 
Schmidt rank D can be different from the dimension D of the noiseless quantum channel 
used to communicate from Alice to Bob, both for deterministic protocols in which a maximum 
of Ld messages can be sent with perfect fidelity, and also for unambiguous protocols in which 
message x is faithfully transmitted with a probability t x . 

In the deterministic case we considered uniformly-entangled states for D < D, where for 
completeness the previously published encoding protocol for sending Ld = DD messages was 
included in Sec. IIV Al and for D > D, where we showed in Sec. IIV Cl that Ld is actually 
less than D 2 , unless D is a múltiple of D, and if it is a múltiple of D other states besides 
one that is uniformly entangled can be used to achieve the optimal protocol. For pure states 
that are not uniformly entangled, our principal result is the inequality (J2b|) bounding Ld in 
terms of the largest Schmidt coefficient Ai. The utility of this bound is confirmed by the 
fact that it is satisfied as an equality by several results and conjectures in j^j, as discussed in 
Sec. IVIH Bl We also showed in Sec. IIV Dl that a protocol which achieves Ld cannot be used 
to send additional messages in an unambiguous fashion, i.e., with a nonzero probability of 
failure. 




(63) 
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For unambiguous protocols, our main results are for D < D and the saturated case in 
which t x > for all DD messages. This implies that the encoding operation must be an 
isometry, and that allows an analysis in Sec. |V|leading to the inequality (J4"%|) which bounds 
the average value of 1/t x in terms of the smallest Schmidt coefficient À5. If in addition one 
assumes the isometries are mutually orthogonal, there is an analogous bound (jüüjl on the 
average of t x , that is, the average probability of success. For the case in which all the t x 
are identical both of these inequalities yield the same bound, and we showed that it can be 
achieved by a protocol in which Bob first uses entanglement concentration to produce, with 
some probability, a uniformly entangled state. If successful this allows use of the dense coding 
protocol for such states, as described in Sec. II V Al We also showed that for D > D it is 
always possible to send D 2 messages unambiguously, though we have no bounds comparable 
to those for D < D. 

IX B Open qüestions 

In the case of deterministic dense coding a very significant and challenging problem is to 
determine the "phase diagram" of the maximum number La of messages as a function of the 
Schmidt coefficients {Xj} of the entangled state. The work in [3] represents a good beginning, 
and it would be of interest to confirm the conjectures given there, and extend them if possible 
to more general principies determining, or at least strongly constraining, the vàlues of Lj. 
Since our inequality (J2b|) seems satisfied as an equality at certain boundary points of their 
L d regions, one can hope that it, or perhaps other inequalities yet to be discovered, might 
prové of some assistance in working out the phase diagram. A major unanswered question is 
whether isometric (unitary) encoding is always sufficient to achieve Ld, or whether for certain 
entangled states one needs to employ a more general encoding procedure. Could it be, for 
example, that the absence of cases in which = D 2 — 1 reported in jïïj reflects a limitation 
of unitary encoding? Only further study can answer that and similar qüestions. The case of 
D > D, where isometric encoding is clearly not possible, is even less well understood than 
that of D < D, apart from certain special entangled states in the case where D is an integer 
múltiple of D. Even for the simple example of D = 3 and D = 2, we do not know whether 
it is possible by dense coding to send more than the D = 2 deterministic messages allowed 
by the quantum channel itself. 

While we have identified an optimal protocol for unambiguous dense coding when the 
success probability t x is independent of x for all D D messages, assuming D < D, the case 
in which t x depends on x remains open; we do not even have a bound on the average 
probability of success, aside from the information-theoretic (|14j) . The bound (J48j) on (1/r) 
requires t x > for all x, and is unlikely to be helpful when some of these probabilities are 
quite small. If some of the r x are zero one needs to allow for the possibility of nonunitary 
or nonisometric encoding, which may be a diflicult task. We know even less about what 
can be done unambiguously when D is larger than D. It is always possible to send D 2 
messages unambiguously with positive probabilities, but is this the largest number possible? 
We suspect it is, but it would be nice to have a proof. 

Both in the deterministic and in the unambiguous case, and also for noisy entangled states 
as discussed in Sec. IVIH Dl a major unanswered question is whether optimal protocols can 
always be based on unitary encoding, or whether more general procedures are sometimes 
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needed. Up till now almost all studies of extensions of Standard dense coding have simply 
assumed unitary (or isometric) operations. But in very few cases are there proofs or even 
plausible arguments that this type of encoding is optimal. Of course, when studying a 
difficult problem it is often a good strategy to make various assumptions which allow one 
to calculate an explicit result, and leave till later the problem of justifying them. We in no 
way wish to undervalue work of this type (including our own) and the insights which have 
emerged from it. Nonetheless, one should not forget that more general forms of encoding 
exist, and that in particular situations they might lead to better transmission of information. 
Exploring the encoding process is itself an interesting problem which might make a significant 
addition to our understanding of the bàsic quantum principies underlying dense coding. 
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A Appendix. Kraus Rank and Density Operator 

We will show that for the framework used in Sec. |ül see Fig. ^ the Kraus rank of the 
encoding operation (Jlj). which is to say the number of linearly independent {A^} operators 
for a fixed x, is equal to the rank of the density operator p c t describing the state available 
to Bob for measurements. In what follows, x is always fixed. It could be omitted from the 
notation, but retaining it clarifies the connection with formulas in Sec. |H] 

Let the state resulting from unitary time evolution, Fig. ^ be 

m = (W x ®I b )(\g°) ® |$)) =J2 X j\ hl ) ® ( A *iW)) ® (A.l) 

jl 

It gives rise to a reduced density operator 

p cb = Tr h (\V)(Sr\) =MMt, (A.2) 

where 

M = J2 X j [Axi\a j ) ® (h l \ = (Ic ® $)K (A.3) 
ji 

is a map from Ti h to Ti c <8> $ is defined in (JÏÏ7J), and 

K = Y,[M*')®W j )] (h l \, (A.4) 
ji 

maps Tih to Ti c <8> Ti a . The argument that identifies the Kraus rank with the rank of p c b 
proceeds in three steps: (i) the rank of K is the Kraus rank; (ii) the rank of M is that of K; 
(iii) the rank of MM^ is the rank of M. 
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For the first step, note that if {|c m )} is an orthonormal basis of TC C , the matrix elements 
of K are 

(c m a j \K\h!) = (c m \A xl \a j ). (A.5) 

By definition, the rank of K is the number of linearly independent column vectors, labeled 
by /, in its matrix, but this is obviously the same as the number of linearly independent 
matrices (c m \A x i\a^), again labeled by l, which by definition is the Kraus rank. 

That the ranks of M and K are the same is a consequence of the fact that J c <S> $ in (jA.3|) 
is a nonsingular map (rank D D) from 7í c <g> 7~í a to TC C (g> H-b- That multiplying a matrix by 
a nonsingular matrix leaves its rank unchanged is a Standard result of linear àlgebra, as is 
the equality of the ranks of M and MM'; see, for example, p. 13 of [7j. This completes the 
argument. 

What is perhaps a more intuitive approach to this result can be given using atemporal 
diagrams; see the very similar argument in Sec. VC of ^2] with reference to Fig. 13 of that 
paper. Translated to the present context, the essential observation is that W x in Fig. Q with 
\g°) fixed may be considered a map from Ti h to Tí c ® 7í a — thus K as defined in ÍJA.4|) — and 
the Kraus rank can be identified with the rank of this "cross operator." 



B Appendix. Extensions of Orthogonal Isometries 

Lemma: Suppose a set of N mutually orthogonal isometries maps a <i-dimensional space 
to a D-dimensional space 7í c , and d < D. Then an extension of these to isometries that map 
a (d+ l)-dimensional space to a D-dimensional space cannot preserve mutual orthogonality 
unless N < D. 

Proof: Let the original and extended isometries be 

d d+1 

j n = Y;\n)(k\, K n = j2\n)(k\, (b.i) 

fc=l k=l 

for 1 < n < N. Assume both sets are orthogonal in the sense of (Jlòj) . so that for n ^ n' 

d 

Tr c (j n , 4) = J2(n\n') = o, 

k=l 
d+1 

Tr c (K n ,Kl) = J2(n\n) = 0, (B.2) 

k=l 

which means that 

«il^ + i) = (B.3) 

for unequal n and n' between 1 and N. Since the are a collection of nonzero (in 

fact, normalized) states in a Hilbert space 7í c of dimension D, they can only be orthogonal 
to each other for N < D, which proves the lemma. 

Note that in the situation discussed in Sec. IVI Cl where d = D — 1, we are interested 
in a collection of iV = (D — 1)D orthogonal isometries, and since D — 1 is 2 or more, the 
lemma shows that they cannot be extended to orthogonal isometries mapping a D- to a 
.D-dimensional space. 
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